POISSON SPACING STATISTICS FOR VALUE SETS 
OF POLYNOMIALS 

PAR KURLBERG 

Abstract. If / is a polynomial with integer coefficients and q is 
an integer, we may regard / as a map from Z/qZ to Z/qZ. We 
show that the distribution of the (normalized) spacings between 
consecutive elements in the image of these maps becomes Poisso- 
nian as q tends to infinity along any sequence of square free integers 
such that the mean spacing modulo q tends to infinity. 



1. Introduction 

Let / be a polynomial with integer coefficients. Given an integer q, 
we may regard / as a map from Z/gZ to Z/gZ, and the image of this 
map will be denoted the image of f modulo q. The purpose of this 
paper is to investigate the distribution of spacings between consecutive 
elements in the image of / modulo q as q tends to infinity along square 
free integers. The main emphasis will be placed on the highly composite 
case, i.e., by letting q tend to infinity in such a way that the number 
of prime factors of q also tends to infinity. 

The case f{x) = x 2 and q prime was investigated by Davenport. In 
jHl d he proved that the probability of two consecutive squares being 
spaced h units apart tends to 1~ h as q — ► oo. We may interpret this as 
if spacings between squares modulo prime q behave like gaps between 
heads in a sequence of fair coin flips. 

The case f(x) = x 2 and q highly composite was studied by Rudnick 
and the author in ^J^|. If we let to(q) be the number of distinct prime 
factors of g, then the number of squares modulo q equals [T, and 
the average spacing between the squares is given by 

s - q - 2^) 17 P 
Lhte 2 P\q p 
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Hence s q — > oo as u(q) — > oo, so we would expect that the probability 
of two squares being 1 unit apart vanishes as ou(q) — > oo, and it is 
thus natural to normalize so that the mean spacing is one. A natural 
statistical model for the spacings is then given by looking at random 
points in R/Z; for independent uniformly distributed numbers in R/Z, 
the normalized spacings are said to be Poissonian. In particular, the 
distribution P(s) of spacings between consecutive points is that of a 
Poisson arrival process, i.e., P(s) = e~ s , and the joint distribution of 
I consecutive spacings is a product I independent exponential random 
variables (see jHJ). Using Davenport's result together with the heuristic 
that "primes are independent", it is seems reasonable to expect that 
the distribution of the normalized spacings between squares modulo q 
becomes Poissonian in the limit s q — > oo, and the main result of [14J is 
that this is indeed the case for squarefree q (the general case is treated 
in [HI.) 

What can be said about more general polynomials / G Z [x] ? For p 
prime, let 

Q p := {t G Z/pZ : t = f(x) for some y G Z/pZ} 

be the image of / modulo p. Given k > 2 and integers hi, hi, . . . , h k -i, 
let 

N k ((hi, h 2 ,..., h k -i),p) := \{t G Q p : t + hi, . . . ,t + h k -i G Q p }\ 

be the counting function for the number of fc-tuples of elements in the 
image of the form t, t + hi, . . . , t + h k -i- Letting s p := p/|f2 p | denote 
the average gap modulo p, the "probability" of an element being in the 
image is l/s p . Thus, if the conditions t G Q p ,t+hi G Q p , . . . ,t+h k -i G 
fl p are independent, we would expect N k ((hi, h 2 , . . . , h k _i),p) to be of 
size p/Sp, and a natural analogue of Davenport's result is then that 

(1) N k ((hi, h 2 ,..., h k -i),p) = p/s k p + o(p) 

as p — > oo provided that 0, hi, . . . , h k -i are distinct modulo p. In [TU] 
Granville and the author proved that 

(2) N k ((h, h 2 ,..., h k -i),p) = pjs\ + o Uk yv) 

holds if / is a Morse polynomial and 0, hi, . . . , h k -i are distinct modulo 
p. Using this, Poisson spacings for the image of Morse polynomials in 
the highly composite case follows from the following criteria (see [TU] . 
Theorem 1): Assume that there exists e > such that for each integer 
k>2, 

(3) N k ((hi, h 2 ,..., h k -i),p) = ^ (1 + O k ((l - s; 1 ^)) 

s p 
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provided that 0, hi, h 2 , . . . , h k _i are distinct mod p. If s p = p°^ for all 
primes p, then the spacings modulo q become Poisson distributed as s q , 
the mean spacing modulo q, tends to infinity. 

What about non-Morse polynomials? Rather surprisingly, it turns 
out that does not hold for all polynomials 1 . For example, in [HI] it 
was shown that for f(x) = x 4 — 2x 2 , 

(2/3-^ + 0( v ^) iih = ±l modp,p=l mod 4 
4/3-| + 0( v ^) if/i = ±l mod p,p = 3 mod 4 
| + 6(y/p) if h £ ±1, mod p 

Hence the assumptions in Q are violated. However, we can prove 
that (J2J) holds for most values of (ho, hi, ... , h k _i): 

Theorem 1. Let p be a prime and let 

(4) R P :={f(O--f'(O = 0,^eT p }. 

be the set of critical values modulo p. If the sets R P ,R P — hi,R p — 
hi, . . . , R p — hk-i are pairwise disjoint 2 , then 

(5) N k ((h u h 2 ,..., h k _i),p) = p/s k p + O ftk (y/p) 

In other words, the analogue of Davenport's result holds for all but 
0(p k ~ 2 ) elements in (Z/pZ) fc_1 . Allowing for overlap between two 
translates of the set of critical values, we also have the following weaker 
upper bound on N k ((h 1} h 2 , . . . , h k -i),p): 

Proposition 2. Let p be a prime. There exists a constant Co < 1, 
only depending on f , with the following property: if the sets 

(R p U R p — hi), R p — h 2 , . . . , R p — h k -i 

are pairwise disjoint and hi ^ mod p, then 

C 

N k ((hi, h 2 ,..., h k -i),p) < —j—j- ■ p + O ftk (y/p) 

s p 

It turns out that these two results are enough to obtain Poisson 
spacings in the highly composite case. However, rather than studying 

1 In particular, the spacing distribution for the image of such polynomials is not 
consistent with the coin flip model! (That is, independent coin flips where the 
probability of heads is given by \Q p \/p.) 

2 In the case fix) = x 2 this condition is equivalent to 0, h\, . . . , h k -i being dis- 
tinct modulo p. However, for general polynomials (including the case of Morse 
polynomials), the two conditions arc not equivalent. 
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the spacings directly, we proceed by determining the k-level correlation 
functions. Let 



be the mean spacing modulo q, and given h = (hi, h 2 , . . . , hk-i) G Z k 
put 

N k (h,q) := \{ten q :t + h 1 ,t + h 2 ,...,t + h k .. 1 G Q q }\ 
For X C R fc_1 , the k-level correlation function is then given by 



The main result of this paper is then the following: 

Theorem 3. Let q be square free, k > 2 an integer, and let X C R fc_1 
be a convex set with the property that (xq, x%, . . . Xk-i) G X implies that 
Xi 7^ xj if i j . Then the k-level correlation function of the image of 
f modulo q satisfies 



as s q — > oo, where C < 1 is the constant given in Proposition^ 

Using a standard inclusion-exclusion argument (see [13] , appendix A 
for details), this implies that the spacing statistics are Poissonian. In 
particular we have the following: 

Theorem 4. For q square free, the limiting (normalized) spacing dis- 
tribution 3 of the image of f modulo q is given by P{t) = exp(— t) as 
s q — > oo. Moreover, for any integer k > 2, the limiting joint distribu- 
tion of k consecutive spacings is a product Yli=i ex P(~ U) °f & indepen- 
dent exponential variables. 



By normalized spacings we mean the following: with < X\ < X2 < • ■ ■ < 
x \n q \ < 1 being integer representatives of the image of / modulo q, the spacings 
between consecutive elements are defined to be Ai = xi+\ — Xi for 1 < i < \Clq\, and 
AifiJ = x\ — X\q j + q. The normalized spacings are then given by Ai := Ai/s q . 



Q q := {t G Z/qZ : t = f(x) for some x G Z/qZ} 



be the image of / modulo q, let 



q/\Q q 
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1.1. Some remarks on the mean spacing. We note that the only 
way for which s p — 1 for all primes p is if f(x) is of degree one. However, 
there are nonlinear polynomials / such that s p — 1 for infinitely many 
primes. For example, if f(x) = x 3 and we take q to be a product of 
primes p = 2 mod 3, then s p = 1 for all p\q, and s g = Yl p \ q s p = 1 
clearly does not tend to infinity. On the other hand, if deg(/) > 1, 
there is always a positive density set of primes p such that s p > 1. 
Moreover, if / is not a permutation polynomial 4 modulo p, Wan has 
shown ^Hj that 

(6) \n p \< P -^- 

degtj) 

Thus, for primes p such that s p > 1, s p is in fact uniformly bounded 
away from 1. 

It is also worth noting that Birch and Swinnerton-Dyer have shown 
[T] that for / Morse, \fl p \ = Cf -p + Of(y/p) where c/ < 1 only depends 
on the degree of /, hence s p = 1/c/ + 0(p -1 / 2 ) for all p, and thus 
s q — > oo as — ► oo. 

1.2. Related results. There are only a few other cases for which Pois- 
son spacings have been proven. Notable examples are Hooley's result 
[TH IT2"] on invertible elements modulo g under the assumption that 
the average gap s q = q/(f)(q) tends to infinity, and the work by Co- 
beli and Zaharescu 3j on spacings between primitive roots modulo p, 
again under the assumption that the average gap s p = (p— l)/0(p— 1) 
tends to infinity. Recently, Cobeli,Vajaitu, and Zaharescu [2] extended 
Hooley's results and showed that subsets of the form {x mod q : x G 
l q -,x~ x G Jg} have limiting Poisson spacings if the intervals J g , J q have 
large lengths (more precisely, that \I q \ G [g 1_ ( 2 / 9 ( 1 °s 1 °s'?) 1/2 ) ; g], and 
|J g | G [g 1_1 /( 1 °s 1 °s < ?) ,g]) as g tends to infinity along a subsequence of 
integers such that q/(j>(q) — > oo. 

1.3. Acknowledgements. The author would like to thank Juliusz 
Brzezihski, Andrew Granville, Moshe Jarden, Zeev Rudnick, and Thomas 
J. Tucker for helpful discussions. 

2. Proof of Theorem □ 

Given a polynomial / G ¥ p [x] and k distinct elements h = 0, hi, h 2 , . . . , h^^i G 
F p , we wish to count the number of t G ¥ p for which there exists 



4 / is said to be a permutation polynomial modulo p if |fi p | = p. 
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x , Xk-i G Fp such that 

/(x ) = t + /i , f(xi) =t + hi, f(x 2 ) =t + h 2 ,..., f{xk-i) = t + h k -i 
In order to study this, put 

h := (hi, h 2 ,..., hk-i) 
and let X k ^ be the affine curve defined by 

X k ,h ■= {f(%o) = t, f(x{) =t + hi,..., f(x k -i) = t + hk-i}, 
and let F p [X fc h ] be the coordinate ring of X kh . We then have 

(7) N k ((hi,h 2 ,...,h k _i),n p ) 

= |{m G ¥ p [t] : 971 |m for some degree one prime 971 G FpfA^h] }| 

In order to estimate the size of this set, we will use the Chebotarev 
density theorem, made effective via the Riemann hypothesis for curves, 
for the Galois closure of F p [X fc h ]. Thus, let Y k h be the curve whose 
function field F p (Yfc,h) corresponds to the Galois closure of the extension 
¥ p (X k , h )/¥ p {t). 

We begin with the case k = 1. Given h G F p , define a polynomial 

F h G ¥ p [x, t] by 

F h (x,t) :=f(x)-(t + h). 
Since the t-degree of F h is one, F h is irreducible, and thus K h = 
¥p[x,t]/Fh(x,t) is a field. Let be the Galois closure of Kh, and 
let Gh = Gdl(L>h/¥ p (t)) be the Galois group of the field extension 
Lh/¥ p (t). By allowing for worse constants in the error terms, we may 
assume that p > n, so that all field extensions are separable, and no 
wild ramification can occur. 

The following Lemma shows that Gh and Lh H F p are independent 
ofh. ' 

Lemma 5. Let h G ¥ p . Then G h = G and L h n fp = L n F^. 

Proof. Define a F p -linear automorphism a : ¥ p [t] — > ¥ p [t] by o~(t) = t + 
h. Since cr(F Q ) = Fh we may extend a to an isomorphism a' : L — > Lh- 
Moreover, given r G G , a'r(a')^ 1 G Gh, the map r — > a / r(a / ) _1 gives 
an isomorphism between G and G^. 

Let Zo = Lq fl Fp and let lh = Lh H F p . Since Zo/Fp is normal, 
/o = er'(^o) c ^ fl Fp = Z^, and the same argument for (cr') 1 gives 
that lh C Iq, hence lh — Iq- □ 

Thus 

l:= Lo^Wp 

is the field of constants for Lh for any h G F p . Arguing as in the proof 
of Lemma El we obtain: 
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Lemma 6. Let H h := GdA(L h /l(t)). Then H h = H . 

Our next goal is to obtain a criterion for linear disjointness for the 
field extensions L^/lit) as h varies. 

Lemma 7. Let Ei,E 2 be finite extensions of¥ p (t), both having the 
same constant field I , and degree smaller than p. If Ei/l(t) and E 2 /l(t) 
have disjoint finite ramification, then E x fl E 2 = l{t). 

Proof. Let E = Ei(lE 2 . By the assumption, E/l(t) can only ramify at 
infinity. Moreover, the ramification must be tame. With denoting 
the genus of E, the Riemann-Hurwitz genus formula now gives 

-2 < 2(g E -1) = [E: Z(f)]2(0 - 1) + £>(<p/°°) " 1) deg(^) 

<P|oo 

= -2[E : l(t)) + [E : l(t)} - £ deg(<P) < -[E : l(t)} 

and thus [E : l(t)] < 2. □ 

We now easily obtain the desired criteria for linear disjointedness. 

Proposition 8. If the sets R p , R p — hi, R p — h 2 , . . . , R p — hj are pair- 
wise disjoint, then the field extensions L /l(t), L hl /l(t), . . . ,L hj /l(t) are 
linearly disjoint. 

Proof. Since Lh is the Galois closure of Kh, both extensions, relative 
¥ p (t), ramify over the same primes. The assumption of pairwise dis- 
jointness of R p , R p — hi, . . . , R p — hj means that there is no common 
finite ramification among the fields L , L^, . . . L^., hence any intersec- 
tion of compositums of the fields must, by Lemma [7| and Lemma 03 
equal l(t) □ 

If G = Gal(E/¥ p (t)) is the Galois group of an extension E/¥ p (t) 
with constant field I, define (following Cohen (HE]) 

G* := {a E G : a\ m = Frob(/(t)/F p (t))} 

where Frob(7(t)/F p (i)) is the canonical generator of Gal(Z(t)/F p (t)) 
given by x — > x p . 

Let L k = ¥ p (Y k;h ) be the compositum of the fields L ho , L hl , . . . , L hh _ x . 
For k > 2, define a conjugacy class Fix fc h C Gal(L k /¥ p {t))* by 

Fix fc>h := {a e Ga\{L k /k{t))* : 

a fixes at least one root of for i — 0, 1, . . . , k — 1} 
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For k — 1 we define (note that there is no dependence on h) a conjugacy 
class Fi Xl c Gal(L l /¥ p (t))* by 

Fix! := {a G Gal(L 1 //c(t))* : a fixes at least one root of f(x) — t} 

Then, taking into account 0^/(1) ramified primes, we have 

(8) N k (h,p) = 

= |{m G F p [t] : deg(m) = 1, 3Wt\m, Wl c F p [Y fc)h ], Frob(mt|m) G Fix fc , h }|+O fc)/ (l) 

where Frob(9Jt|m) G Gal(L fc /F p (t)) denotes the Frobenius automor- 
phism. Applying the Chebotarev density theorem (e.g., see jU], Propo- 
sition 5.16), we obtain 

Our next goal is to determine | Fix^h |/| Gal(L fc //(£)) |. 
Lemma 9. Given k > 2, define 

Ck{h ' P) : " |Gal(LVK*))l' 

and 

aw: |FiXl1 



Gal(LV/(t))|' 

Assume that R p , R p — hi, . . . , R p — h k -i are pairwise disjoint. Then 
C k (h,p) = Cyipf where C x (p) = l/s p + O f (p- 1 / 2 ). 

Proof. For simplicity, we consider only the case k = 2, and for ease of 
notation, let h = (hi) = (h). 

The action of Gal(L 2 /F p (t)) on the roots of F and allows us 
to identify Gal(L 2 /F p (t)) and Gal(L 2 /Z(t)) with subgroups of S n x 
S n . Moreover, since L Q and are linearly disjoint over l(t) and have 
isomorphic Galois groups, we may identify Gal(L 2 //(t)) = H x H h 
with a subgroup of S n x S n in such a way that 

H = H' xlc S n xlc S n x S n 

and 

H h ^ 1 x H' C 1 x S n C S n x S n 

where H' = H = Hh and if' is a subgroup of 5 n . 

Define a F p -linear map r : ¥ p (t) — > F p (t) by r(£) = t + h, and extend 
it to a map from L to L h . Given n x G Gq, let /i 2 = r/iiT' 1 . Clearly 
1^2 ^ C/,, and since Gal(/(t)/F p (t)) = Gal(//F p ) is abelian, fii\i(t) = 
//2 U(t) an d hence /i 2 G G^. Let us consider the possible extensions of 
/ii,/i2 to L 2 . After making a fixed, but arbitrary choice, of extensions 
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fix, fii we find that all pairs extensions are of the form 7/^2 ) where 
8 G Hh and 7 G ifo- Now, for any such pair of extensions, we have 

5/ii(7/i 2 )~ 1 = Sfix^j- 1 G Gal(L 2 //(t)) 

But since Gal(L 2 /Z(t)) = H x if^ we may choose 7 and 5 in such a way 
that 5/ii/i2 _1 7 _1 = 1. In other words, it is possible to choose fit, fi2 so 
that fix — fi^- 

Thus, there is an extension of \i G Gq to an element ft of Gal(L 2 /F p (t))* 
in such a way that /2 embeds diagonally when regarded as an element 
of S n x S n , i.e., there exists a G S n such that /2 corresponds to 

(a, a) e S n x S n 

Now, all elements of Gal(L 2 /F p (t))*, regarded as elements of S n x S n , 
must be of the form 

(8a, 70-) G S'n x S n 

where 8, 7 G i?'. In particular, if we let if" C if' be the set of elements 
8 such that 8a has at least one fix point, we find that 

I H" 1 2 I H" 1 2 

= |Gal(L 2 //(t))| = |Gal(LV/(t))| 2 = 

since Gal(L 2 //(t)) = H x H h and H h = H = Ga^L 1 /l(t)). 
Finally, we note that 

m>l = p/ s p = |{i G F p for which there exists x G ¥ p such that f(x) = t}\ 

= C 1 (p)-p + O f ( y /p) 
and thus Ci(p) = l/s p + Of(-Jp). □ 

3. Proof of Proposition [21 

We will begin by giving a proof for the case k = 2, and then show 
how the general case can be reduced to this case. We will be using the 
same notation as in the proof of Theorem ^ and, by allowing worse 
constants in the error terms as before, we may assume that p > deg(f). 

3.1. The case k = 2. We start by showing that the field extensions 
K Q , Kh are disjoint if h ^ 0. 

Lemma 10. Let f G ¥ p [x] be a polynomial of degree smaller than p. 
Then the affine curve defined by 

{x,y.f(x)-(f(y) + h) = 0} 

is absolutely irreducible if h ^ mod p. 
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Proof. Let x,y be roots of f(x) = t and f(y) = t + h where t is 
transcendental over F p . If ¥ p (x) and ¥ p (y) are not linearly disjoint 
over ¥ p (t) then, by Liiroth's theorem, ¥ p (x) fl ¥ p (y) = ¥ p (u) for some 
u ^ ¥ p {t). Hence there exists non-constant rational functions g\ and g 2 
such that u = g\(x) = g 2 (y), and a rational function q, of degree less 
than p, such that q{u) = t. However, since t = f(x) = q{u) = q(gi(x)) 
and / is a polynomial, q and g\ must be of a special form: either q 
and gi are both polynomials, or gi(x) = c\ + c 2 /b(x) where Ci,c 2 are 

constants, is a polynomial, and g(-u) = Y^i=o a i/ i u ~ c i)*- m the 
latter case, we can replace u by u = c 2 /(ci — w), and hence we may 
assume that g and g\ are in fact both polynomials. Similarly, since 
1(92(1/)) = t = f(y) — h, we may assume that g 2 is a polynomial as 
well. 

Now, since t is trancendental, so is y and therefore q(g2(y)) = t = 
f(y) — h implies that q(g 2 (x)) = f(x) — h. Thus 

q(9i(x)) -q(g 2 (x)) = h 

and hence g±(x) — g 2 (x) must divide h, which can only happen if gi(x) = 
g 2 (x) + C for some constant C ^ 0. Thus 

q(g 2 (x) + C) -q(g 2 (x)) = h 

and hence 

q'(g 2 (x) + C)g' 2 (x) - q'(g 2 (x))g' 2 {x) = 

which, since g 2 is non-constant, implies that 

q'( 9 2(x) + C) = q f (g 2 (x)) 

Therefore, if g 2 (a) = f3 where q'((3) = we find that q\[3 + C) — 
q'(j3) = 0, and more generally, that q\(3 + lC) — for I — 0, 1, . . .p — 1, 
which is impossible since the degree of q is smaller than p. 

Thus, the two fields ¥ p (x, t) / (f(x) — t) and ¥ p (y, t) /{f(y)—t — h) are 
linearly disjoint over ¥ p (t) and hence f(x) — (f(y) + h), when regarded 
as a polynomial over ¥ p (y), is irreducible. □ 

We are now ready to give a proof for Proposition |21 in the case k = 2. 

Lemma 11. There exists Cq < 1, only depending on f, with the follow- 
ing property: for all sufficiently large p for which f is not a permutation 
polynomial modulo p, 

C 2 ((h),p)<C /s p 



if h ^ mod p. 
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Proof. For / fixed there are only finitely many possibilities for Gal(L 2 /F p (t)), 
hence C 2 ((h),p) = \ Fix 2 ^h) |/| Gal(L 2 /l(t))\ can only take finitely many 
values. Thus, since C 2 ((h),p) < C\(jp) = l/s p + 0/(p~ 1//2 ) it is enough 
to show that C 2 ((h),p) = C\(p) can only happen for finitely many 
primes p. 

Given a G F p , let M(a) — \{x G F p : f(x) — a}\. Then 

\{x,y G F p : f{x) = f(y) + h}\ = J2 M(a)M(a + h) 

aeFp 

On the other hand, by Lemma ITUl the curve defined by f(x) = f(y) + h 
is absolutely irreducible, and hence the Riemann hypothesis for curves 
gives that 

\{x, y G F p : f{x) = f(y) + h}\ = p + O f (y/p) 

We have 

\{a : M(a) > 0}| = | {a : M(a - h) > 0}| = |Image(/)| = p/s p 
Thus, if 

N 2 {h,p) = \{a G F p : M(o) > 0,M{a-h) > 0}| = 

C 2 (h,p) ■p + O f (^) = C 1 (p)-p + O f (^) = --p + O f (y/p) 

Sp 

then, since \{a : M[a — h) > 0}| = |Image(/)| = p/s p , we have 
\{a G F p : M(a) = 0, M(a - h) > 0}| = 0/(v^) 

Therefore 
p + 0/(v® = ^ Af (a)M(a - fc) 

> Y M(a-h)+2 J2 M(a-h) 

aeW p :M(a)=l ae¥ p :M(a)>l 

Y M(a -h)+ Y M ( a ~ h ) 

agF p :A/(a)>0 agF p :A/(a)>l 

= y M ( a -h)+ Y M ( a ~ h )- Yl M ( a ~ h ) 

aeFp agF p :Af(a)>l a& p :M(a)=0 

= p+ Y M(a-h)-O f (y/p) 

agF p :M(a)>l 

and thus 

Y M(a-h) = O f (^p) 

a& p :M(a)>l 
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Hence 

\{a G W p : M(a) > 1, M(a - h) > 0}| = O f (y/p) 
and we similarly obtain that 

| {a G W p : M(a) > 0, M(a - /i) > 1}| = 0/(^) 

But then 

a£¥ p 

= \{a G F p : M(a) = M(a - h) = 1}| + 0/(v^) 

In other words, M(a) = 1 for all but Of(y/p) elements, which, by Wan's 
result (see (JHJ), section [TTTJ) , can only happen if / is bijection once p is 
sufficiently large. 

□ 

3.2. The case k > 2. As usual, we use the convention that h = 0. 
Arguing as in the proof of Lemma we find that the field extensions 

(L h0 L hl ) /l(t), L h Jl(t), . . . , L hk _Jl(t), L hk _Jl(t) 

are linearly disjoint since they have disjoint ramification. Hence there 
is an isomorphism 

Gal (L h0 L hl L hk _Jl{t)) 

~ Gal {L h0 L h Jl{t)) x Gal (L h Jl(t)) x ... x Gal (L hk _Jl(t)) 

Putting h' = (ho, hi) and arguing as in Lemma we find that 

I FiX ^,h I = I Flx 2,h' | J_ = r (h , , 1 

|Gal(LV/(t))| \Gal(L h0 L hl /l(t))\ 2[ ,P) s^' 

By Lemma ITU C2(h',p) < Cq/s p and the proof is complete. 

4. Proof of Theorem El 

For h G Z fc_1 fixed, it follows immediately from the Chinese Re- 
mainder Theorem that Nk(h,q) is multiplicative in q. The following 
Lemma shows that we may assume that q is a product of primes p for 
which / is not a permutation polynomial modulo p, and hence that s p 
is uniformly bounded away from 1 for all p\q. 

Lemma 12. Given a square free integer q, write q = q\qi where 

qi = II p ' q2 = n p 

p\q p\q 

|n P |<p m>[=p 
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Then 

R k {X,q) = R k (X, qi ) 

Proof. If p\q 2 we have s p = p/\Q p \ = 1 and N k (h,p) = p for all h G 
Z fe ~ x . Thus s q = s qi ■ s qi = s qi , and since for h fixed, N k (h, q) is 
multiplicative, we find that iVjfe(h, q) = N k (h, qi) • q 2 . Thus 

Rk{X,q) = -±- Yl AT fc (h,g)= Q2 J2 ^( h ^0 

= 7^7 N k (h, qi ) = R k (X, qi ) 

□ 

We also note the following easy consequence of Theorem d 

Lemma 13. Let I be the largest integer such that Rp—h^, R p —hi 2 , . . . , R p — 
hi l are pairwise disjoint for some choice of indices < i\, i 2 , ■ ■ ■ , i\ < 
k — 1 fit;ii/i t/ie nsna/ convention that ho = 0). Then 

N k ((hi, h 2 ,..., /i fe -i),p) < + O f>k (^/p) 

Proof, li {h[, h' 2 , . . . h'^} is a subset of {hi, h 2 , ■ ■ ■ , h k _i\ then trivially 

Nktthuhi,...,^-!)^) < N l ((ti 1 ,ti 2 ,...,ti l _ 1 ),p) 

and the Lemma follows from Theorem ^ □ 

4.1. Some remarks on affine sets. We will partition Z k ~ 1 according 
to the size of the bounds on N k (h, q) = JT, N k (h, p) given by Theo- 
rem^and Proposition El In order to do this, we need to introduce some 
notation: By an affine set L C Z fc_1 we mean an integer translate of a 
lattice L' C Z fc_1 . We then define the rank, respectively discriminant, 
of L as the rank, respectively discriminant 5 , of L'. Similarly, we define 
codim(L) as k — 1 minus the rank of L. 
Let R be the set of critical values of /, i.e., 

R := {/(£) : AO = o, e e Q} 

and recall that R p = {/(£) : /'(£) = 0,£ G F p } is the set of critical 
values of / modulo p. Let 

R := R - R = {a - (3 : a, (3 e R}, 

put 

Roo := R H Z, 

5 By the discriminant of L' C Z fc_1 we mean the index of L' in Z fc_1 . 
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and let 

R p := (Rp — R p ) fl F p . 

If Rp + hi fl Rp + /ij 7^ then /ij — hj G so the affine sets to be 
considered will be given by equations of the form 

(9) hi - hj = r, r G -Roo 
or congruences of the form 

(10) hi — hj = r p mod p, r v G i? p 

We note that the bounds given by Theorem ^ and Proposition El only 
depends on the congruence class of h, but we will treat the case of 
equality separately since Nk(h,p) will be large for all p\q if h satisfies 
an equation of the form Q. 

To ensure that the equations defining the affine sets are independent, 
we will need the following notions: Given 

Ec{{i,j):0<i<j<k-1} 

we may associate a graph G(E) on the set of vertices {0, 1, . . . , k — 1} 
by regarding E as the set of edges, i.e., two nodes i,j are connected by 
an edge if and only if G E. Let 

AG := {E C {{i,j) :0<i<j<k-l}: G(E) is acyclic.} 

be the collection of edge sets whose associated graphs are acyclic. 
Given E G AG and a map a : E — > R^, define an affine set 

L(E,a) := {h G Z^ 1 : hi - hj = a((i,j)) for all G E.}. 

(with the usual convention that ho = 0). Note that G(E) acyclic 
implies that the equations defining L(E, a) are independent. Further, 
given E G AG, let 

C(E) := {L(E, a) where a ranges over all maps a : E — > i?oo} 

be the collection of affine sets defined by independent relations between 
hi and hj for all G E. We note that £(0) contains exactly one 
element, namely the full lattice L(0, — ) = Z fc_1 . Moreover, if L G C(E), 
then (since we assume that E G AG) codim(L) = \E\, and if h G L, 
then Proposition |21 will, for allp\q, at best give the bound 

N k (h,p)<C ^ + O Lk (Vp). 

Sp 

(The bound will not hold if the components of h satisfies additional 
equations, i.e., if h G L' for some L' G C(E') such that E' D E.) 
Given L(E,a) G C(E), let 

L x (E, a) := {h G L(E, a) : h ^ L(E', a') for all £" D E, a' : -> .Roo} 
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In particular, if h G L X (E, a), the components of h satisfy exactly \E\ 
independent equations of the form hi — hj = where G R^. 

We also need to keep track of similar relations, modulo p, between 
the components of h. Thus, given E p G AG and a p : E p — > R p , define 
an affine set 

L p (E p ,a p ) : = {h G Z fc_1 : hi—hj = a p ((i,j)) mod p for all G E p }. 

We note that the rank of L P (E P , a p ) is fc — 1 and that the discriminant 
of L p (E p , a p ) is and if h G L p (E p , a p ), then Proposition |21 will at 
best give the bound 

Mb(M< Conger + 0/,*(v^)- 

Sp 

Now, given E G AG, let 

C P (E) := {L p (E p , ap) : E p G .467, a p : E p -> .Rp, i^QE = 0, £ p U£ G 

and for L p G C P (E), let 

L p x := {h G L p : h G* L p for all L' p G £„(££), £ p D E p } 

If h G L x n L* for L G £{E) and L p = L p {E p ,a p ) G £ p (£), then 
(ho, hi, . . . , hk-i) = h satisfies exactly independent equations of 
the form hi — hj = where G Roo, and exactly \E p \ independent 
congruences of the hi — hj = r\j mod p where r\j G R p , and further- 
more, there is no overlap between the equations and congruences. The 
reason for keeping track of equalities and congruences separately is that 
if h G L for L G C(E) and \E\ > 0, then the bounds given on Nk(h,p) 
given by Proposition |21 allows Nk(h,p) to deviate quite a bit from its 
mean value for all p\q. On the other hand, if we let c be the prod- 
uct of primes p\q for which the bounds are bad because of congruence 
conditions, rather than equalities, then we can bound the size of c (see 
Lemma IT7j) . We can now partition Z fc_1 according to the size of the 
bounds on Nk(h,p) given by Theorem ^ and Proposition 121 

Lemma 14. Let L = L(E,a), L p = L p (E p ,a p ) G C P (E), and assume 
that h G L x n L* . If \E\ + \E P \ = 0, then 

N k (h,p) = s- k -p + O k j(p^ 2 ), 
whereas if k > \E\ + \E P \ > 0, then 

N k (h, p)<C - s Jf H*l-* ■ p + Okjip 1 ' 2 ). 
where Cq < 1 is as in Proposition^ 
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Proof. The first assertion follows immediately from Theorem Q since 
R p + hi PI R p + hj ^ implies that hi — hj G R p . 

For the second assertion, we argue as follows: Since h = (hi, h 2 , . . . , h k - 
L x fl L x there are indices ii,i 2 , ■ ■ ■ ,ik-\E\-\E p \ such that h ix ^ h i2 and 

(Rp hi 1 U Rp hi 2 ) , Rp ^-135 ■ ■ ■ ) Rp ^ifc_|£;|_|£;p 

are pairwise disjoint. Putting 

h (hi 2 h^, hi s hi ± , . . . , hik-\E\-\E p \ ^ii)) 

the result follows from the bound for Nk(h',p) given by Proposition^ 

□ 

However, partitioning Z fc_1 according to the size of Nk(h,p) for indi- 
vidual prime factors p\q is not quite enough; we need to partition Z k ~ 1 
according to the size of Nk(h,q) = Yl p \ q Nk(h, p) . Thus, let 

C C (E) := {L n (n p , c L p ) : L G C(E), Vp|c L p G C P (E) \ L p ($, -)} 

(where L p (0, — ) G C P (E) is the maximal lattice, i.e., L p (0, — ) = Z fc_1 ) 
and given 

L c = Ln (n plc L p ) G £ c (£) 

let 

L c x :=L><n(n P | C L p x )n(n p|f L p x (0,-)) 

We can now partition Z fc_1 into subsets L*, where L c G C C (E), E G 
and c|g. Moreover, as an immediate consequence of the definitions 
and Lemma El we obtain the following: 

Lemma 15. Assume that L c = L fl (r\p\ c L p (E p , a p )) G C C (E) and that 
h G L*. If p\c, then 

N k (\p) = s- k -p + O k j(p 1 ' 2 ). 

If p | c, then 

N k (h,p) < C ■ s [ E \ + ^- k -p + O k j(p^ 2 ). 
where Co < 1 is as in Proposition^ 

Using the previous Lemma we can now bound sums of the form 

Lemma 16. If 

L c = Ln (n plc Lp(E p , otp)) G C C (E), 

then 

\{h g s q x n 4 X }| < |{h g Sg x n l c }| « fcj/)X £*-i_L_ + ^i- 2 
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Moreover, i/h6L c x , then 

^ « n + o^ap- 1 ' 2 )) n U ■ f + o, /(P -"- 

p|c p\ c 

In particular, 



:») E 



q/s„ 
hes a xriL* 



« s^c-^^ + h ■ ct q) n (i + o fcl/ (p- i/2 )) 

Sq C 

p|<? 

Proof. The first assertion follows from the Lipschitz principle 6 (e.g., see 
Lemma 16 in |14j ) since L c is a translate of a lattice with discriminant 
(relative L) divisible by c. The second assertion follows from LemmalTol 
Thus 

e ^ « n + °^ ,2) ) n + o W (p-^)) 

he s ,XnL* q/Sq p\c\ P J p\i 

+ j n + ^/(p- i/2 )) • n po + o kJ ( P -^)) 

Q P\c p|§ 

« cr (c) (^ + 4^) ■ ^ n c 1 + °^(^ i/2 )) 

p|<? 

□ 

Since the bound in (JTTj) is not useful for large c, we will also need 
the following: 

Lemma 17. Let d be the degree of the field extension Q(i?)/Q. If 
L c E C C {E) for some E e AG and s q X n L c x ^ £/ien 



-X,R °9 

Moreover, there exist a constant D, only depending on k and f , such 
that 

\C C (E)\ « fe)/ D^ c \ 



6 Actually, we have to be a little careful: if we embed L into Z fc_1_ l B l and apply 
the Lipschitz principle, there is an implicit constant in the bound that will depend 
on L. However, the estimate is uniform since L only can be choosen in Ofc(l) ways. 
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Proof. We first assume that all elements of R are algebraic integers. 
Let B be the ring of integers in Q(R). For each prime p\q chose a 
prime C B lying above p, so that we may regard any element in R p 
as the image of an element in R under the reduction map B — > B /Vfi p . 
For < % < j < k - 1, r G R, and he L c x , let 

7<j,r(h) n p 

p:hi—hj=r mod ^p p 

II 7iJ,r(h) 
0<i<J<fc— 1 
raR:hi— hjj^r 

mod for all p dividing 7ij, r , we find that 7ij jr 
hj—r). Moreover, if h G s g X, then \hi — hj\ <Cx s q , 

c< n Nf(/ il -/ lr r)« y ,x S f 10 

0<j<jr'<fe-l 
rdR-.hi— hjj^r 

(Note that N§ (R) (/i; - hj - r) ^ since tn-hj-r^O). 

In case -R contains elements that are not algebraic integers, we can 
find an integer m, only depending on R, such that all elements of 
m ■ R — {m ■ r : r G R} are algebraic integers, and apply the above 
argument to m ■ R and mh (for primes p not dividing m, but since c is 
square free this just makes the constant worse by a power of (c, m) < m, 
which is 0(1).) 

The second assertion follows upon noting that there are Ofcj(l) pos- 
sible choices of E p and a p for each p\c. 

□ 

4.2. Conclusion. We can now write Z k ~ 1 as a disjoint union of sets 
L x where L ranges over all elements in Ue£Ag£(E), an d hence Rk(X, q) 
equals 

(12) E ^( h >?) = E E ^( h >?) 

1 g ' hesqVnzfe- 1 1 q ' EeAG LeC(E)hes q xr\L x 



Then c divides 



Since hi — hj—r 

divides N§ (iJ) (/i; 
thus 



and hence 
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The term corresponding to E = in (|12j) will give the main contribu- 
tion (note that if E = 0, then L = L^E, -) = Z fc_1 .) Let 

X' := {h e X : hi - h A & for < % < j < k - 1} 
where we as usual use the convention that ho = 0. Then 

Sq x n l x = Sq x' n z^ 1 

Note that X' is just R^ 1 with some hyperplanes removed, so if X 
is convex, we can write X' as a finite union of convex sets. We now 
rewrite (fT2j) as follows: 



N k {h,q)= Yl N k (h,q) + EiiOT 1 

where 



ql hesqXnz*- 1 hesqX'nz*- 1 



Error! := E E E N ^ ?) 



91 £;e^G,|E|>OLe£(-B)hes 9 xnL > ' 



and the main term is given by 

(13) £ iV fc (h,<z) 

We begin by showing that Error! = o(l) as — > oo. 
Lemma 18. As u>(q) — > oo, 



r > = jh E E E AT fc (h, g )«c 



|B|>0 



Proo/. Given £ G with \E\ > 0, we find that 

( 14 ) ]^| E E 



LeC{E) hes q xnL> 



= ^E E E *wm 

which, by Lemmas ITHl and IT7I is 

(is) «c a;(g) -n(i+o(p- i/2 )) ^ D-( C ) S fc-i C ' - a;(c) (-+-) 



Pl9 c l<? 

'9 
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Now, 



and, for any 5 > 



c 

<j(*)isi 



j_ yWc) fc-l^-^c) 1 V 

c|<7 

i(a)l«l 



„fc-l/^- w ( c ) 
S c °0 



c\q tig c\q 



c«s„ 



« -^km n (i + °(v/)) « 1 _, < ^ ho(1) 

Thus, taking 5 = l/(2d(*)\R\), we find that (T5J) is 

« cff« n a + o( P -" 2 )) . ( + n (i + oft,-)) ) 



^( 9 )(l-o(l)) 
"0 



«^ (9) -n( i +°(^ i/2 ))= c7 o 
pi? 

Since there are 0(1) possible choices of I 6 f° r ^ fixed, and E 

ranges over a finite number of subsets, we find that (|14|) is Cq ■ 

□ 

We precede by rewriting the main term in terms of a divisor sum. 
For p prime and h G Z fc_1 , let 

s^ 1 • N k (h,p) 

£kM = ^ h 

so that we may write 

N k (h,p) = ^(l + e k (h,p)) 

(recall that s p — p/\Q p \.) Further, for d > 1 a square free integer, put 

£fc(h,d) = JJe fc (h,p) 

and, to make e k multiplicative in the second parameter, set £fc(h, 1) = 1 
for all h. Since N k (h,q) is multiplicative, we then have 

(16) N k (h,q) = nirlPpI (l + e*(h,p)) = %rE^( h ' rf ) 
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The following Lemma shows that the average of s k (h, d), over a full 
set of residues modulo d, equals zero if d > 1. 

Lemma 19. If d > 1 then 

£ e fc (h,d)=0 

hg(Z/dZ) fc ~ 1 

Proof. Since £fc(h, cf) is multiplicative it is enough to show that 

£fc( h >P) = 

heCz/pZ)*- 1 
for p prime, and because 

N k {h, P ) = —\n p \(i + £k (h,p)) 

it is enough to show that 

N k (h,p) = -^- 1 \n p \ P k - 1 = \n p \ k 

he(z/pZ) fc -! p 
But X]he(z/pZ) fe - 1 -^fe(h,p) equals the number of fc-tuples of elements 
from Q p , and hence She(z/ P z)fc-i -Nfc(h,p) = |f2 p | fc . □ 

We will also need the following bound: 
Lemma 20. We have 

Mh,d)|«d fc - 3 / 2+ °« 

he(Z/dZ) fc - 1 

Proof. Since the sum is multiplicative in d, it is enough to show that 

Mh,p)|« P fc - 3 / 2 

he(z/ P z) fc - 1 

for p prime. By Theorem ^ |£fc(h,p)| <C p -1 / 2 for all but 0(p k ~ 2 ) 
residues modulo p, and for the remaining residues we have |£fc(h,p)| = 
O kJ {\). Thus 

Mh,p)l « P k - l p- 1/2 + P k ~ 2 « P k ~ 3/2 

he(z/ P z) fc - 1 

□ 

We now find that the main term (JT3|l equals 

-L £ AT fe (h ! g) = -l T £ £ £fe (M) 

1 91 hGSqX'nZ*" 1 9 d\q h£s q X'nZ k - 1 
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= Yl 1 + Error2 

where ^ 

Er ror 2 := — ^ ^ e k (h,d) 

9 d\q hgSgX'nZ*" 1 

and the modified main term is 
7^1 E 1 = ^t(vo1(^X') + 0( S J- 2 )) 

= vol(X) + 0(l/s 9 ). 
We conclude by showing that Error 2 = o(l) as s q — > 00. 
Lemma 21. v4s s g — > 00, we have 

(17) Error, = -^Y, E e *( h » rf ) « ^ 1/2+ ° (1) 

d>l 

Proof. In order to show that Error2 is small, we split the divisor sum 
in two parts according to the size of d. 

Small d: We first consider d < s T where T £ (0, 1) is to be chosen 
later. A point h 6 s q X' fl Z fc_1 is contained in a unique cube C*h,d C 
R of the form 

Ch,d = {(xi, x 2 , ■ ■ ■ , x k _i) : dU <Xi< d(ti+l),ti G Z, i = 1, 2, . . . , k-1} 

We say that h e SqX'flZ^ -1 is a d-interior point of if Ch,d C s ? X', 
and if C^d intersects the boundary of s q X', we say that h is a d- 
boundary point of s g X'. 

By Lemma FTTH the sum over the ci-interior points is zero, and hence 

( 18 ) E 

9 d\q hes g x'nz fc - 1 
Kd<s, 

= 7^1 E E ^( h > rf ) 

9 d|g hes 9 x'nz fe - 1 

Kd<s^ h is d-boundary point 

Since s q X' is a union of convex sets, the number of cubes C^ d inter- 
secting the boundary of s q X' is -C (s q /d) k ~ 2 , and hence (fT£|) is 

«^i E W~ 2 E M h ' rf )i 

9 d|g heCZ/dZ)*- 1 
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(19) 4Ei E i^ h ' rf )i 



5 g rf G? 

9 d]g hg(Z/dZ) fc -' 

i<a<«J 



which by Lemma QUI is. for any a > 1/2, 

<g JL rf 1/2+o(1) < s'f' 1 ^V /2-a+o(1) < S « T - 1 +°( 1 ) 

since 

£ d " e = II( 1+ p- e ) = a ? 1) 

if e > (recall that s p is assumed to be uniformly bounded away from 
1 and s q = U P \ g Sp-) 

Large d: We now consider 

( 2 o) iE E 



s 

d\ 



d\q hesgX'nzfe- 1 



Given h and d, let c be the largest divisor of d such that h G L c for 
some L c e C C {L). Then 

£fe(h ' d)<< (d/0?V2- O (l) 

by Lemma Hoi Hence, for E 6 fixed, 

£ £ e fc (h,rf)«^ J2 E i^( h ' rf )i 
< E ( d / c u/2-o(i) E E 1 

which by Lemmas ITBI and IT7I is 

(21) « s"- 1 ■ d- 1 ' 2 ^ ■ Y s *-i c i/2-«(i)D«(c) (- + -) 

c<. 

Now, 



c« S <") |S| 



^ g C C g « £ c -l/2 + o(l) s o(l) 

c\d c\d 
d(o)\R\ d(i)\R\ 
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and similarly 
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c 



1/2+0(1) 



eld 



c\d 



<{t)\R\ 



Thus J2DI) is 



/ 



(22) 



fc-i 



1 d\q 

d>s? 



\ 



.o(l) 



+ 



1 



E 



dV2-o(l) s dV2-o(l) 

g c|d 



l/2+o(l) 



1™ E <<- 



1/2+0(1) 



Iy_L_ y 



4/2+o(l) 



d>sj 



d\q 
d>s^ 



d(*)|fl| 



Now, for any (3 G (0, 1/2), 



^ d - 1 /2+ (i )<< ^ d -i/2+ (i ) M^J 



d\q 

d>s 



-/3T 



l/2+o(l) < s^^+oC 1 ). 



d|g 



Similarly, for any 7 > 

E 



cW) « 4"' (5)lSl 



c|d 

and thus 



d(|)|K| 



E 

c\d 



c l/2- 7 +o(l) < g 7<5)l«l 



rf l/2- 7 +o(l) 



E 



rf l/2-o(l) 
<i|g c|d 



^ c l/2+o(l) <<; s? 



ld(l)\R\ 



+o(l) ^ B 7<5)|H|+o(l) 



d(jf)|K| 



V cT 7 ^"' ! 1 « .s 
d\q 



Hence (El is 



/3T+o(l) + ^-1+7^(2)1^+0(1) ^ Q -l/2+o(l) 



< S' 



if we take T = 1 — o(l), /3 = 1/(2T) - o(l), and 7 = l/(2dg)|£|). 
Thus, with a = 1/2 + o(l) (to bound the contribution from small d), 
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we find that 

Error 2 = JL £ £ £fc (h, d) « s?W> 

q d\q hGSgX'nZ*- 1 
d>l 

□ 
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